# Image Content Understanding
Typhoon2 Qwen2vl 7b Vision Instruct
Apache-2.0
Typhoon2-Vision is a Thai-supported visual language model capable of processing image and video inputs, specifically optimized for image-based applications.
Text-to-Image
Transformers Supports Multiple Languages

T
scb10x
793
11
Vision 8B MiniCPM 2 5 Uncensored And Detailed 4bit
The int4 quantized version of MiniCPM-Llama3-V 2.5, significantly reducing GPU VRAM usage (approximately 9GB)
Text-to-Image
Transformers

V
sdasd112132
330
30
Minicpm Llama3 V 2 5 Int4
The int4 quantized version of MiniCPM-Llama3-V 2.5 significantly reduces GPU VRAM usage to approximately 9GB, suitable for visual question answering tasks.
Text-to-Image
Transformers

M
openbmb
17.97k
73
Tinyllava 1.1b V0.1
Apache-2.0
A lightweight visual question answering model based on TinyLlama-1.1B, trained using the BakLlava codebase
Text-to-Image
Transformers

T
0xAmey
16
21
Featured Recommended AI Models